Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization

نویسنده

  • Hagai Aronowitz
چکیده

This paper presents a novel framework for unsupervised compensation of intra-session intra-speaker variability in the context of speaker diarization. Audio files are parameterized by sequences of GMM-supervectors representing overlapping short segments of speech. Session-dependent intra-session intra-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection (NAP) method. The proposed compensation method is evaluated in the context of speaker diarization in two-speaker conversations. A simple and effective twospeaker diarization algorithm is introduced in which speaker diarization is performed in the compensated supervectorspace. The proposed diarization algorithm was evaluated on summed telephone conversations and achieved a speaker error rate of 2.8% which is a 54% relative error reduction compared to a baseline BIC-based system. Finally, we evaluate the proposed system on a speaker recognition task in the summedspeech condition where improvement in speaker recognition accuracy is observed using the proposed diarization system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Diarization Based on Gmm Supervectors and Unsupervised Intra-speaker Variability Modeling

This paper presents a novel framework for speaker diarization. Audio is parameterized by a sequence of GMM-supervectors representing overlapping short segments of speech. Session dependent intra-session intra-speaker variability is estimated online in an unsupervised manner, and is removed from the supervectors using Nuisance Attribute Projection (NAP) The supervectors are then projected using ...

متن کامل

Speaker Diarization using Unsupervised Compensation of Within-Speaker Variability

This paper presents a novel framework for unsupervised compensation of within-speaker variability in the context of speaker diarization. Audio session is divided into overlapping short segments, each one parameterized by a GMM-supervector. For each session independently within-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection ...

متن کامل

Intra-session Variability Compensation for Speaker Segmentation

This paper addresses the problem of speaker segmentation in two speaker telephone conversations, proposing a segmentation approach based on factor analysis and a novel method for intra-session variability compensation to improve segmentation performance. The segmentation system is evaluated on the NIST Speaker Recognition Evaluation 2008 summed channel test condition, showing that intra-session...

متن کامل

Speaker Diarization in Personal Video Recordings Based on LDA and User Feedback

In this paper, we present the speaker diarization system which is used in personal video recordings. Speaker diarization begins by the extraction of relevant features from the input signal. Features are measurable characteristics which are important to the distinction between different classes. They should have low inter-class similarity and also low intra-class variability. So, LDA is used to ...

متن کامل

Exploiting Intra-Conversation Variability for Speaker Diarization

In this paper, we propose a new approach to speaker diarization based on the Total Variability approach to speaker verification. Drawing on previous work done in applying factor analysis priors to the diarization problem, we arrive at a simplified approach that exploits intra-conversation variability in the Total Variability space through the use of Principal Component Analysis (PCA). Using our...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010